Rebalancing the Load in Distributed File Systems

نویسندگان

  • C. Radhika
  • P. Ravindra
  • D. Krishna
چکیده

Distributed file systems are key building blocks for cloud computing applications supported the Map Reduce programming paradigm. In such file systems, nodes at the same time serve computing and storage functions; a file is partitioned off into variety of chunks allotted in distinct nodes so Map Reduce tasks are often performed in parallel over the nodes. However, in a cloud computing setting, failure is that the norm and nodes is also upgraded, replaced, and accessorial within the system. Files can even be dynamically created, deleted, and appended. This leads to load imbalance during a distributed file system; that's, the file chunks don't seem to be distributed as uniformly as doable among the nodes. Rising distributed file systems in production systems powerfully depend upon a central node for chunk reallocation. This dependence is clearly inadequate during a large-scale, failure-prone setting as a result of the central load balancer is anesthetize significant employment that's linearly scaled with the system size, and should therefore become the performance bottleneck and therefore the single purpose of failure. During this paper, a completely distributed load rebalancing formula is bestowed to cope with the load imbalance drawback. Our formula is compared against a centralized approach during a production system and a competitor distributed answer bestowed within the literature. The simulation results indicate that our proposal is comparable the prevailing centralized approach and significantly outperforms the previous distributed formula in terms of load imbalance issue, movement price, and algorithmic overhead. The performance of our proposal enforced within the Hadoop distributed filing system is additional investigated in a cluster setting. Index Words: Distributed files, map reduce, replaced, dependence, chunk reallocation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Secure and Privacy-Preserving Distributed File Systems on Load Rebalancing in Cloud Computing

Distributed file systems in cloud computing because Google File System GFS and Hadoop Distributed file systems HDFS scheduled central servers in the direction of manage and load balancing in the metadata. Enabling technology for largescale computation for big data Originated by Google. Open source implementation by Yahoo and Facebook. Distributed file systems DFS are keys structure block for cl...

متن کامل

Stabilizing Load across Cloud for Distributed File Access

Distributed file systems in clouds such as GFS(Google File System) and HDFS(Hadoop Distributed file systems) rely on central servers to manage the metadata and the load balancing. DFS are keys building block for cloud computing application based on the reduce programming. In this file system node at the same time provide computing and storage functions, a file dived into a number of parts alloc...

متن کامل

Secure Load Rebalancing Algorithm for Distributed File Systems in Cloud

Distributed file systems are key technology for cloud computing applications. In such file system, each node having storage as well as computing functionalities. A file is partitioned into a number of chunks allocated in distinct nodes so that data processing can be performed in parallel. Specifically, in this study, we suggest offloading the load rebalancing task to storage nodes by having the...

متن کامل

Balancing the Load to Reduce Network Traffic in Private Cloud

Infrastructure-As-A-Service (IAAS) provides an environmental setup under anyone type of cloud. In Distributed file system (DFS), nodes simultaneously serve computing and storage functions; that is parallel Data processing and storage in cloud. Here, file is considered as a data. That file is partitioned into a number of chunks allocated in distinct nodes so that MapReduce tasks can be performed...

متن کامل

Water-Filling: A Novel Approach of Load Rebalancing for File Systems in Cloud

File systems serves as the backend for cloud computing and load balancing is the relevant issue in context of resource utilization for distributed file systems in cloud. Prior to this, it is fruitful to identify the load on the storage servers (nodes) which is equivalent to number of file chunks it stored. Here is an extension of load balancing i.e. water-filling load rebalancing operated on di...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014